On Image Classification: Correlation v.s. Causality

نویسندگان

  • Zheyan Shen
  • Peng Cui
  • Kun Kuang
  • Bo Li
چکیده

Image classification is one of the fundamental problems in computer vision. Owing to the availability of large image datasets like ImageNet and YFCC100M, a plethora of research has been conducted to do high precision image classification and many remarkable achievements have been made. The success of most existing methods hinges on a basic hypothesis that the testing image set has the same distribution as the training image set (i.e. the i.i.d. hypothesis). However, in many real applications, we cannot guarantee the validity of the i.i.d. hypothesis since the testing image set is unseen. It is thus desirable to learn an image classifier, which can perform well even in non-i.i.d. situations. In this paper, we propose a novel Causally Regularized Logistic Regression (CRLR) algorithm to address the non-i.i.d. problem without knowing testing data information by searching for causal features. The causal features refer to characteristics truly determining whether a special object belongs to a category or not. Identifying causal features allows us to construct classifiers adaptive to distributional changes in the non i.i.d circumstances even when the testing set is unseen. Algorithmically, we propose a causal regularizer for causal feature identification by jointly optimizing it with a logistic loss term. Assisted with the causal regularizer, we can estimate the causal contribution (causal effect) of each focal image feature (viewed as a treatment variable) by sample reweighting which ensures the distributions of all remaining image features between images with different focal feature levels are close. The resultant classifier will be based on the estimated causal contributions of the features, rather than traditional correlation-based contributions. To validate the effectiveness of our CRLR algorithm, we manually construct a new image dataset from YFCC100M1, simulating various non-i.i.d. situations in the real world, and conduct extensive experiments for image classification. Experimental results clearly demonstrate that our CRLR algorithm outperforms the state-of-the-art methods. We further visualize the top causal features selected by our algorithm on our image dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Human Capital on FDI with New Evidence from Bootstrap Panel Granger Causality Analysis

T his study evaluates the causality relationship between human capital and foreign direct investment inflow in twenty-six OIC (the Organization of Islamic Cooperation) countries over the period 1970–2014. We employed the panel Granger non-causality testing approach of Kònya (2006) that is based on seemingly unrelated regression (SUR) systems, and Wald tests with country specific boot...

متن کامل

Stock Market Interactions between the BRICS and the United States: Evidence from Asymmetric Granger Causality Tests in the Frequency Domain

The interaction of BRICS stock markets with the United States is studied using an asymmetric Granger causality test based on the frequency domain. This type of analysis allows for both positive and negative shocks over different horizons. There is a clear bivariate causality that runs both ways between the United States stock market and the respective BRICS markets. In addition, both negative a...

متن کامل

The Impact of Trade Openness on Economic Growth in Pakistan; ARDL Bounds Testing Approach to Co-integration

T he main objective of this paper was the investigation of the impact of the trade openness on economic growth in Pakistan. We have been employed both the Johensen and Autoregressive Distributed Lag (ARDL) Co-integration together with ECM Techniques for the period 1975-2016. The empirical estimated results are the sound evidence that there exists a short...

متن کامل

Core Inflation and Economic Growth, Does Nonlinearity Matters? A Nonlinear Granger Causality Analysis

T his empirical analysis endeavors to trace out the causal nexus between core inflation and economic growth from the perspective of twenty worlds’ leading economy with the help of the nonlinear Granger causality approach by using time series data from 1981 to 2016. Based on nonlinear Granger causality results, it has been found that there is unidirectional casualty running from core ...

متن کامل

Automatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique

The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.06656  شماره 

صفحات  -

تاریخ انتشار 2017